AITopics | feedback training

Collaborating Authors

feedback training

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

19ca14e7ea6328a42e0eb13d585e4c22-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 14:36:59 GMT

While harmonic mean score is increased from 52.2% to 62.2% on AwA1, there are less drastic changes between (a) and (b). Reviewer#2 - 1) Notation As recommended, we will reexamine the notations and try to modify them to simple8 forms. Therefore, in our opinion, unknown attributes would be the better assumption for15 zero-shot problems butitisstillworthstudying withknownattributessimilar toother worksmentioned above.(a-e)16

artificial intelligence, feedback training, reviewer, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

Reviewer 1

Neural Information Processing SystemsOct-2-2025, 06:18:22 GMT

We hope it will be better supplements than recommended, then we will include this figure. Figure 1: Structure visualization of learned dataset A wA1,2. While harmonic mean score is increased from 52.2% to 62.2% on A wA1, there are less drastic changes between (a) and (b). To generate missing datapoints by implementing Eqn. We will add this additional explanation in our paper.

artificial intelligence, datapoint, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Distributed Gossip-GAN for Low-overhead CSI Feedback Training in FDD mMIMO-OFDM Systems

Cao, Yuwen, Liu, Guijun, Ohtsuki, Tomoaki, Yang, Howard H., Quek, Tony Q. S.

arXiv.org Artificial IntelligenceSep-16-2025

The deep autoencoder (DAE) framework has turned out to be efficient in reducing the channel state information (CSI) feedback overhead in massive multiple-input multipleoutput (mMIMO) systems. However, these DAE approaches presented in prior works rely heavily on large-scale data collected through the base station (BS) for model training, thus rendering excessive bandwidth usage and data privacy issues, particularly for mMIMO systems. When considering users' mobility and encountering new channel environments, the existing CSI feedback models may often need to be retrained. Returning back to previous environments, however, will make these models perform poorly and face the risk of catastrophic forgetting. To solve the above challenging problems, we propose a novel gossiping generative adversarial network (Gossip-GAN)-aided CSI feedback training framework. Notably, Gossip-GAN enables the CSI feedback training with low-overhead while preserving users' privacy. Specially, each user collects a small amount of data to train a GAN model. Meanwhile, a fully distributed gossip-learning strategy is exploited to avoid model overfitting, and to accelerate the model training as well. Simulation results demonstrate that Gossip-GAN can i) achieve a similar CSI feedback accuracy as centralized training with real-world datasets, ii) address catastrophic forgetting challenges in mobile scenarios, and iii) greatly reduce the uplink bandwidth usage. Besides, our results show that the proposed approach possesses an inherent robustness.

artificial intelligence, machine learning, nmse performance, (16 more...)

arXiv.org Artificial Intelligence

2509.1049

Country: Asia (0.46)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

He, Zhiwei, Wang, Xing, Jiao, Wenxiang, Zhang, Zhuosheng, Wang, Rui, Shi, Shuming, Tu, Zhaopeng

arXiv.org Artificial IntelligenceJan-23-2024

Insufficient modeling of human preferences within the reward model is a major obstacle for leveraging human feedback to improve translation quality. Fortunately, quality estimation (QE), which predicts the quality of a given translation without reference, has achieved impressive alignment with human evaluations in the last two years. In this work, we investigate the potential of employing the QE model as the reward model (the QE-based reward model) to predict human preferences for feedback training. We first identify the overoptimization problem during QE-based feedback training, manifested as an increase in reward while translation quality declines. We examine the problem and argue that the vulnerability of the QE model might lead to high rewards for incorrect translations, resulting in overoptimization and error propagation. To address the problem, we adopt a simple yet effective method that uses heuristic rules to detect the incorrect translations and assigns a penalty term to the QE-based rewards for the detected incorrect translations. Experimental results show that the proposed QE-based feedback training achieves consistent and significant improvements across various settings, further verified through human preference studies. Our subsequent analysis demonstrates the high data efficiency of the proposed QE-based feedback training: the proposed approach using a small amount of monolingual data can outperform systems using larger parallel corpora.

computational linguistic, reward model, translation, (12 more...)

arXiv.org Artificial Intelligence

2401.12873

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
(14 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback